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MEAN VALUES OF MULTIPLICATIVE FUNCTIONS OVER FUNCTION 

FIELDS 

ANDREW GRANVILLE, ADAM J HARPER, AND KANNAN SOUNDARARAJAN 


Abstract. We discuss the mean values of multiplicative functions over function fields. In 
particular, we adapt the authors’ new proof of Halasz’s theorem on mean values to this 
simpler setting. Several of the technical difficulties that arise over the integers disappear in 
the function field setting, which helps bring out more clearly the main ideas of the proofs 
over number fields. We also obtain Lipschitz estimates showing the slow variation of mean 
values of multiplicative functions over function fields, which display some features that are 
not present in the integer situation. 


1. Introduction 

We begin by introducing multiplicative functions over the polynomial ring ¥ q [x], highlight¬ 
ing the analogy with multiplicative functions over the integers. In the subsequent sections 
of the introduction we will discuss the new results in this paper. 


1.1. An introduction to multiplicative functions over function fields. In the poly¬ 
nomial ring ¥ q [x \, where q is a prime power, let A4 denote the set of rnonic polynomials and 
let M. n denote the set of monic polynomials of degree n, so that \M. n \ = q n . Upper case 
letters like F, G shall denote monic polynomials. Let V denote the set of irreducible monic 
polynomials, and V n those of degree n, and we reserve the letter P to denote irreducible 
monic polynomials. We denote the degree of a polynomial F by deg(F). 

We are interested in multiplicative functions / : A4 —>■ C; that is, functions / satisfying 
f(FG) = f(F)f(G) for all coprime monic polynomials F and G. The analogous functions 
over the integers, namely multiplicative functions / : N —> C (that is, functions / with 
f(mn ) = /(m)/(n) for all coprime integers m and n), have been extensively investigated. A 
useful tool in studying multiplicative functions over the integers is the Dirichlet series 


F(s) = 

n= 1 


/W 

71 s 



/(P 2 ) 

rl 2.s 


+ . . . 


> 


where the product is over all primes p, and one usually restricts attention to those multi¬ 
plicative functions for which the series and product are absolutely convergent in Re(s) > 1. 
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Correspondingly, to study multiplicative functions over function fields, we put 


(1.1) T(z) = Y f(F) 


deg (F) _ 


[J (l + f(P)z W) + /(p 2 )P<W) + ..X 


F&M 


where we assume that the series and product converge absolutely in \z\ < 1/q. 

In Section 2 we give several examples of interesting multiplicative functions over ¥ q [x], 
and for a general introduction to number theory over function fields we refer to [ 5 ]. For the 
moment, it may be helpful to consider the most basic example, the multiplicative function 
taking the value 1 on all monic polynomials in F f/ [.x']. Here, in the domain \z\ < 1/q we have 


T(z) = Y = (1 - qzY' = lid 


y deg(P)\-l 


71=0 


which corresponds to the Ricmann zeta-function 


71=1 p 


Taking logarithms above yields that 


E 

771=1 


(qz) 


m 


EE 

P k =1 


z k deg(P) 

k 


EE a < p ‘) 


P k =1 


z deg(P k ) 

deg (P k ) 


_ ~7fc _ 

E-' E A(F), 


71=1 


FeM r , 


where A (F), in analogy with the von Mangoldt function of prime number theory, is defined 
to be zero unless F = P k is the power of an irreducible in which case A (F) = deg(P). 
Equating coefficients implies that 


( 1 . 2 ) 


E A <D = i n • 


F£Mr, 


and now Mobius inversion gives the the well-known “prime number theorem for F„[x] 


\Vn\ = -Y J k(d)q n/d = ?- + 0 

n ^—' n 

d\n 


n /2 


Tl 


The analogous relationship in the integers, namely that Y2 n <x A(u) = x + 0(ic 1 / 2+o( ' 1 ' ) ), is an 
open question, equivalent to the Riemann Hypothesis. 

For a general multiplicative function /, take logarithms in (11.ip . and write 


(1.3) 


iogp(^) = 


F&M 


A/(P) 

deg(P) 


deg(P) 


for certain coefficients A/(P) with A/(P) = 0 unless P is the power of an irreducible. 
Differentiating, we may equivalently write (11.3 1) as 

(1.4) zFz) = ^ A ,(F)z**W, 

F&M 


For a given positive real number n we focus on the class of multiplicative functions C(a) 
consisting of those / for which /(1) = 1 and 


(1.5) 


|A/(F)| < kA(F) 
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for all F. The hypotheses (ll.4p and (11.5p ensure the absolute convergence of the series and 
product in (II. ip for \z\ < 1 /q. 

In [2] we studied the analogous class of multiplicative functions / over the integers for 
which |Aj(n)| < kA( n), where —F'(s)/F(s) =: J^ n >i A/(n)/n s . The bound on |Aj(n)| 
guarantees that the Dirichlet series and Euler product defining F(s) are absolutely convergent 
for Re(s) > 1. 

Given a multiplicative function / in C(k) our aim is to understand (for n > 0) the mean 
value 

(!-6) a(n) = a(n-f) ■= \ ^ f(M) 

Q MeMn 

in terms of the corresponding averages of / over prime powers 

(1.7) x(n) = xU;/)-4 E A /T). 

q FeMn 

We have <r(0) = 1, x(0) = 0 and cr(l) = y(l), and then we observe (this follows from (11.411 . 
and will be justified in Remark 2 in Section [2] below) that 

n 

(1.8) na{n) = y^x(fc)cr(n — k). 

k =1 

With this notation, we may also write (11.ip and (1 1.3 p as 

OO OO / J \ 

(1.9) 1F(V9) = E = ex P ( E A). 

n =0 k =1 

The convolution relation (1 1.8 p is a little more involved in the integer situation. The discrete 
relation (1 1.8 p is replaced by the continuous integral equation ua(u) = x(t)cr(w — t)dt, 
where y (f) = J2 n < y t A/(n) is an average of the multiplicative function / evaluated 

at prime powers (here y is a suitably large parameter), and then er(w) approximates (in many 
situations) the mean-value of the function / evaluated over integers up to y u . Such integral 
equations were first considered by Wirsing, and are discussed further in [3j. 


1.2. Halasz’s Theorem over function fields. In |2] we show, generalizing a little the 
pioneering work of Halasz, that if x is large, and if |A/(n)| < nA(n) for all n then 


(i.io) 1 e/ w«. 

n<x 


logic 


1 / log X 


max 

|t|<(I°gir) K 


F( 1 + a + it) 


1 + a + it 


\ da 

)~ + 


(log logo;) 
logic 


If one inserts a trivial bound \F(l + a+it)\ <^ K 1/a K on the right then one recovers the trivial 
bound <C K (logic) K ^ 1 for the left hand side (up to constants), and inserting any non-trivial 
information about F( 1 + a + it) supplies a non-trivial bound for the left hand side. This 
lossless quality is the crucial feature of Halasz-type theorems. 

The left-hand side in (11.101) is independent of the values of f(p k ) on the prime powers 
p k > x. Hence if we define A f±(p k ) = A f(p k ) for all prime powers with p k < x, and 
Aj±(p k ) = 0 otherwise, then f ± (n) = f(n ) for each n < x, and the Dirichlet series F- L (s) : = 
Xn>i f ± i. n )/ nS has the finite Euler product n p < c (l + f ± {p)/p s + /^(p 2 )/^ 25 + •..) which is 
analytic for all s with Re(s) > 0. One can replace F in the upper bound in (11.101) with F J_ , 
which is sometimes convenient. 
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We now describe the corresponding result in the function held setting. Define the mul¬ 
tiplicative function f 1 - by setting A f±(M) = Af(M) if deg M < n, and A f±(M) = 0 if 
deg M > n. Hence x ± ( m ) — x( m ) and cr ± (m) = a(m) for all m < n, whereas cr^n) = 
cr{n) — x{ n )/ n i n view of (11.81) . Now J~ L (z) is entire, whereas J-(z) might only be analytic 
in a disc. Paralleling (II. lOj) . we establish the following result. 


Theorem 1.1. Let f be in the class C(k) and let a(n ) be defined as in (11.61) . Then for all 
n > 1 we have 


(1.11) k ± (n) I < 

and therefore 

|cr(n)| < |cr- L (rz)| + 


K 


n 


X(?i) 


max |J r± (^/g)| 
\z\=Vt 


j.n—1 


l - 1 


dt , 


n 


< — [ ( max |J r± (^/g)|) f 1 * )dt + -. 
n ./ n V |*| =y /i /VI - t J n 


Our short proof of Theorem 11.11 is given in Section 3. Note the strong parallel between 
this theorem and the corresponding estimate (11.101) . In our bound for <r(n) above, the term 
n/n in the bound arose from the contribution of irreducibles of degree n. Correspondingly, 
in (II. 10p the error term (loglog x) K / logic includes the contribution from primes near x, but 
also includes contributions from certain other numbers, and from error terms in truncating 
Perron integrals, and these do not arise in the simpler function held setting. 

As in (11.91) . 


( 1 . 12 ) 


XHz/q)= Y, /-‘-(DF) 

F&M ^ 


Z \ de s( F ) 


= a^(n)z'" = 

n =0 


n—1 


( x(k) k 

ex p ( ~r z 


and so one can rephrase the estimate in Theorem 11.11 as 


(1.13) 


kWI < 


K 


n 


r (exp (max Reg 

Jo V V hl=^ 


xU) 


1 -t 


n—1 


l - 1 


k =1 


dt T 


K 

n 


For every hxed real number 9, the function fe(M) = f(M)e(—Q deg(M)) is also multi¬ 
plicative, with xe(k) = x(k)e(—k9), and correspondingly ag(n) = a(n)e(—n9) (throughout 
we define eft) := e 2mt ). This is analogous to the “twist” f(n)n~ ld of a multiplicative function 
/ on the integers. Note that the inequalities in Theorem 11.11 and (j 1.13 [) remain unchanged 
if we replace x by Xe and a by a e . 

The integral in Theorem 11.11 can be difficult to work with, and we now give a slightly 
weaker bound which is simpler to use. By the maximum modulus principle we may bound 
maxij = ^ |J rJ -(£/g)| by max| z | =1 \J r± (z/q)\. Moreover, if \z\ < 1 then 


n—1 


\x(k)\ 


n—1 


log I F^{z/q)\ < —j— < k Yt - Kl °g( 2ri ) 5 


k =1 


k =1 


so that \J r± (z/q)\ < (2 n) K . In Section 3.2, using these bounds appropriately in Theorem ll.il 
we show the following corollary. 


Corollary 1.2. For all integers n > 1, we have 

|cr(n)| < 2 k(k + 1 + M)e~ M (2n) re_1 , 
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where 


max|J rJ ‘(^)| =: e M {2n) 


M =1 

I I q 


Since M > 0 we deduce from Corollary 11.21 the trivial bound |cr(n)| <C re n K_1 (see Remark 
5 in Section [2] for a more precise estimate). Inserting any non-trivial lower bound for M 
will yield a non-trivial bound for |cr(n)|, which as remarked earlier is the crucial feature of 
Halasz-type theorems. 

In the integer situation, the bound corresponding to Corollary 11,21 is 

± £ /(«)«. a + MK^iog.r 1 + 

x logic 

n<x 


where 


max 

|i|<(l°gz) K 


F ± (l + it) 
1 + it 


e~ M (logx) K . 


This is usually stated with F ± (l + it) replaced by F(1 + 1/logs + if); note that these two 
quantities are of comparable size (up to multiplicative constants). 


1.3. Lipschitz-type theorems for mean values of multiplicative functions over 
function fields. Mean values of multiplicative functions in number fields vary slowly with 
x, provided one corrects by the “rotation” of the form n ld that best approximates f(n); that 
is, one replaces f{n) by the twist fe(ri) := f(n)n~‘ e where 6 maximizes |F ± (1 + id) | with 
\0\ < log x. One can show that the mean value of / up to x equals x l9 /(1 + id) times the 
mean value of fg plus a small error term. Let us restrict for simplicity to the case n = 1. 
Building on Elliott’s work jT], in [4] (and see also [2]), we obtained the bound 

( L14 ) ^ 

n<x 1 +‘t‘ n<x 

whenever < <fi < 1. In [2j we found examples showing the sharpness of (11.1411 . up 

to the factor of log 2/0. Thus the exponent 1 — 4 cannot be increased in general, and we 
say that 1 — 4 is the Lipschitz exponent for mean values of multiplicative functions over the 
integers. 

We now give analogous “Lipschitz estimates” in the function field case, again restricting 
attention, for simplicity, to functions in C(l). 


Theorem 1.3. Let f be in the class C( 1), and let a and \ be defined as in (II.611 and (II.7H . 
Let 7i > 2 be given, and let = f ±,n , y -1 , and J r± (z) be defined as before. Select 

6 G [0,1) for which |J r± (e(— 6)/q)\ is maximized. Then for any integer i with 1 < t < n, we 
have 

I / n\ ( m 2n (l°g ; 

\a e (n + £) -a e (n)\ < (-) log — + 

where ag(k) = a(k)e(—kd), and m is the smallest odd integer that does not divide £, with 
Cm ■= l/(msin(^)). 



Since agin) 


o{n)e(—nd ), Theorem 11.31 implies that 


cr(n + i)\ 


\a(n)\ 




£\ 1 ~n, 2n (logn) 0 ^ 1 ^ 

— ) l°g d- ~ c - 

n / £ n l Cm 
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Note that 1 — c m increases tol — 2/tt as m —> oo through odd values. The first term of the 
upper bound in Theorem 11.31 corresponds to the upper bound in (11.141) . but the second term 
indicates a new phenomenon, which has no parallel in the integer situation. In Section [6] we 
will construct examples, for each odd m > 1, of / for which \ae{n + i) — crg(n)| l/n 1-Cm 
for a positive proportion of integers n. For instance, if m = 3 we take y(n) = 1 if 3 divides 
n, and x( n ) = ~~ 1 otherwise. Then one can show that cr(3n) = — cr(3n + 1) i/(r(|K /3 ), 
whereas cr(3n + 2) = 0. Note here that 1 — c 3 = 

When i is large, the first term in the Lipschitz bound dominates the second, and one 
can construct examples in which the exponent 1 — 2/7T is attained. Since this situation 
is in complete analogy with the situation for integers (see 0). we do not carry out this 
construction here. 


2. Some Examples and Remarks 

In this section we collect together some remarks on our class C(k) and offer various motivating 
examples of functions in this class. The proofs of our theorems are deferred to the subsequent 
sections. 

Remark 1. The Dirichlet convolution of f\ G C(k i) and f 2 G C(k 2 ) is given by ( fi*f 2 )(F ) := 
YJab=f fi(A)f 2 (B), and lies in the class C(k 1 + k 2 ). The Rankin-Selberg convolution, 
fi x / 2 , is defined by setting A f lX f 2 (M) = 0 unless M is a prime power, in which case 
A/ lX / 2 (dT)A(M) = \f 1 (M)Aj 2 (M), so that f\ x / 2 is a multiplicative function in the class 
C(k The function fi x / 2 matches the product /i/ 2 on squarefree M, but the two func¬ 
tions differ on prime powers P k with k > 1 with the Rankin-Selberg convolution being the 
more natural choice. 

Remark 2. Let / G C(a), and let the averages a and x be as in (jL6j) and (11.71) . Note that 
x(0) = 0 an d |x( n )| < K f° r all n. We now prove the convolution relation (j 1.8 [) satisfied by 
cr and y. First note that 

/(F)deg(F) = ^A / (D)/(F/D), 

D\F 

which follows upon comparing the two sides of the relation zlF'(z) = (zip'/J r (z))J r (z). Tak¬ 
ing the average over F G M n gives 

"*(") = d ^ /(F)deg(F) = l ^ E A /( b )/(F/F) 
q FeMn q FeMn D\F 

= Eb E A /(B))(—J E /("))• 

k =1 H D£M k H MeM n -k 

In other words we have the convolution identity 

n 

na(n ) = y(/c)cr(u — k). 

k= 1 

As discussed in the introduction, this is a simpler, discrete version of the integral equation 

pu 

ua(u) = / x(t) a ( u ~ t)dt 
Jo 
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that occurs for number field mean values, which was first considered by Wirsing, and dis¬ 
cussed further in [3]. 

Remark 3. The convolution identity (1 1.8 [1 shows that cr(n), the average of / over elements 
of A4 n , depends only on the y-vafues, which are the average of / taken over suitable prime 
powers, but not on the individual f(P e )- Thus there is no loss in generality in assuming that 
f(P e ) = x(k) whenever deg (P e ) = k. We will use this observation repeatedly in the sequel 
when discussing and constructing examples. Further, this observation means that we can 
view Theorem 11.11 purely as a result in analysis: given information about the coefficients of 
the power series Xlvli x(k)z k /k, it obtains information about the coefficients of the series 
0 ^ n = eMT,ZiX(k)z k /k). 

Example 4. In the introduction, we saw that if f(P k ) = 1 for all prime powers P k then 
J-(z) = (1 — qz ) -1 , and y(/e) = 1 for all k > 1, and a{n) = 1 for all n > 0. 

Generalizing this, we consider the construction where x(fc) = a for some fixed a € C, and 
all k > 1. We find (either by solving the recurrence (11.81) . or by noting that the corresponding 
generating function F(z) equals (1 — qz)~ a ) that 

_ fa + n - 1\ _ a{a + 1) • • • (a + n — 1) 

\ n J n\ 


Therefore cr(n) ~ n“^ 1 /T(a) for large n. This is analogous to the Selberg-Delange theorem, 
which gives asymptotics for J2 n<x d z (n) where C( s ) 2 — E n >i d z {n)/n s . 

When a = k £ N, the example above deals with the fc-divisor function over Fjx], and 
(12. ip gives the average number of ways of writing a polynomial F of degree n as the product 
F\ ■ ■ ■ F k . The case a = — 1 deals with the analog of the Mobius function: f(P) = —1 for 
irreducibles P , and f(P e ) = 0 for f > 2. Finally, note that when a = —k is a negative 
integer then er(n) = 0 for all n > k + 1. 


Remark 5. If / £ C(k), then by induction using (j 1.8 1) we see that for all n > 0, 


a(n)| < 




Example 6. Smooth polynomials. A y-smooth integer is a positive integer n, all of whose 
prime factors are < y. The indicator function of y-smooth integers is the completely mul¬ 
tiplicative function / for which f(p) = 1 if p < y, and f(p) = 0 otherwise. It is known 
that for a wide range of u > 1 there are ~ p{u)y u y-smooth integers up to y u , where 
p{u) = e^d+oi 1 ))^ 10 ?^ j s the Dickman function (which equals 1 for 0 < u < 1, and is defined 
by up{u) = fy p(u — t)dt for u > 1). 

Analogously an m-smooth polynomial is one all of whose irreducible factors have degree 
< m. Consider the construction where %(£) = 1 if 1 < l < m and y(f') = 0 for i > m. If 
we determine cr(-) using (1 1.8 j) then N(n,m), the number of m-smooth monic polynomials 
of degree n, satisfies N(n,m) > cr{n)q n . Now cr(n) = 1 = p(n/m ) for 0 < n < m. By an 
induction hypothesis and (jl.8p one can then deduce that er(n) > p(n/m ) for all n, as follows: 
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the final equality holding since up{u) = f"_ f p(v)dv for u > 1. In fact there is some room to 
spare in the lower bound above, and one can show that 

a[n) > p(n/m)ex p 

for some positive constant c. This implies that in the function field case, the Dickman 
function is not a good approximation to N (n, m) if n ~ m 2 (whereas it is a good approxi¬ 
mation in the corresponding range u ~ log y for the y-smooth integer counting problem). A 
similar phenomenon occurs when we count “smooth permutations”; that is, elements of S n 
composed of cycles of length at most m. 



Remark 7. In Example 4 we saw that if x(£) = — k is a negative integer for all £ > 1 then 
a(n) equals zero for all n > k + 1. We now consider the converse situation: if cr(n) = 0 for 
all n > k + 1 (where k > 0), what can we conclude about x(T)? Our assumption implies that 

OO k 

F(z) = £V(n)( 9 *)“ = J>(n)( 9 *)“ 

n =0 n =0 

is a polynomial of degree k. Factoring T into its roots we obtain 

k 

f ( z ) =nc i —i) 

3 =1 

for some complex numbers Oij. Since log fF(z) = ^ 1 p'{ c l z Y is holomorphic in the region 
\z\ < 1/q, we see that |ay| < q, or in other words all the zeros of fF lie in \z\ > 1 /q. Further 
we have 

<ix(k) = ~J2 a r 

3=1 

If |x(^)| A k for all £, it follows that there can be at most a values of ay with |ay| = q, and 
the rest are strictly smaller than q in magnitude. 

If / G C(k) with k < 1 satisfies cr(n) = 0 for n > k + 1, then from the above we conclude 
that x(f) must decrease exponentially for large t. If k — 1, then either |ay| < q for all j, in 
which case x(^) once again decreases exponentially for large £, or \oij\ = q for some j (say 
j = 1). In the latter case, we may use Dirichlet’s theorem to find £ such that all the a £ j (for 
1 < j < k) have argument in (—7t/8, 7t/8) say, and this forces k — 1 (else one would find an 
£ with \ X {£)\ > 1). Thus in this case one must have J-(z) = (1 — qze{9 )) for some 0; in other 
words, the only possibility for f is a twist of the Mobius function by some 6. This is a simple 
analog of a striking converse theorem of Koukoulopoulos [5] for multiplicative functions over 
the integers. It may be interesting to work out a precise analog of his result, which would 
involve imposing (given / G C(l)) the weaker restriction |cr(n)| -C n” 2 ~ 5 for some S > 0, and 
deriving a similar dichotomy for the behavior of y(^). 

We proved that for any k > 0, at most of the ay can have size q. Here too, one 
would like to replace the condition that cr(n) = 0 for large n, by a weaker condition like 
cr(n) n~ A for some A = A(k). Koukoulopoulos and the third author have taken some 
first steps in this direction for multiplicative functions over the integers. 




MEAN VALUES OF MULTIPLICATIVE FUNCTIONS OVER FUNCTION FIELDS 


9 


Remark 8. One of the main results in [3] states that if / : N —?• { — 1,1} is a completely 
multiplicative function, then for large x one has ^2 n<x /( n ) — (^i + o(l))x, where 


5\ — 1 — 2 log(l + yfe) + 4 


r-V? 


log t 
t + 1 


dt = -0.656999. 


and the constant hi is optimal. The exact parallel of this result is false in the function held 
setting: the function f(F ) = (—l) de sH) is completely multiplicative, and here <r(n) = (—l) n . 
ft would be interesting to develop the right version of this result in the function held setting. 
Perhaps parity is the only substantial obstruction to such a result? 


Example 9. Let e(«i), ..., e(atk) be distinct points on the unit circle, and let a\, ..., a & be 
complex numbers, all bounded by 1 say. An interesting class of examples is given by setting 
(for n > 1) 

k 

X(n) = djei-noLj ). 
i=i 

Here, by (j 1. 91) . 

OO k 

^a(n)z n =: T{z/q) = fj(l - ze(-aj))~ ai . 

71=0 j = 1 

By matching up the coefficients of the (1 — ze(—aj))~ aj , we may construct a function 

k 

G( z ) = (CoO’X 1 - ze(-atj))- aj + C7i(j)(l - ^e(-a i )) 1_ “ i ), 

3=1 

such that F[zjq) — Q(z), and its hrst derivative are bounded uniformly in \z\ < 1 (the bound 
may depend on the e(cc :) ) , s and dj , but remains uniform as \z\ —> 1). Here, for example, 


c'o(i) = na - e ( a 3 - f p)) ai 


for each j, and the C\ (j) are given by a similar but more complicated 
hrst derivative of Tizfq ) — G{z) is bounded uniformly in \z\ < 1, it 
coefficient of T{zjq) — Q(z) is 0{l/n). Thus we conclude that 


a(n) 


(° 0 (•?') e ( ~ na i ^ + C, i(-?) e ( ~ na 3 ) 1 + 

X>(j)e(- n a 3 )M^+0(i), 

3=1 J 


expression. Since the 
follows that the n-th 



+ 0 



since each | a 3 \ < 1. 


3. Proofs of Theorem 11.11 and Corollary 11.21 

The key to our proof of Halasz’s Theorem over function helds, as well as over number helds, 
is an identity, given in Lemma [3711 below. As discussed in [2], the crucial feature of this 
identity is the presence of three generating functions in the integral on the right-hand side, 
which will allow us to bound that integral efficiently. There is a strong analogy with additive 
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number theory, where ternary problems are accessible to harmonic analysis techniques (such 
as the circle method) but binary problems are usually not. 


Lemma 3.1. Let f be any multiplicative function in the class C(k), and let T(z) be as in 
m- Let r be a positive real number with r < 1/q. Then 


E /< M ) 

MeMn 


1 

n 


E vw) + 

MeMn 



dz dt 

z n+1 T' 


Proof. By Cauchy’s formula we may write, for any 0 < r < 1/q, 

d z 

( 3 . 1} ' ' ^ 1 
Now we write 


E /w) = -f- [ z?' 


yU +1 ‘ 


zT\z) = (zy[z)^T{z) = (zyr(z)) ( x + I ^F(t z )dt) 


r 


1,1 d 


T 

r 


= (z^zj) +(4^) / 


r 


dt 


and use this expression in (13.11) . The hrst term above gives 




T' 

zC(z] 


dz 1 ^—> . \ 

z— n ^ )’ 

M&Mn 


matching the hrst term in the right-hand side of the lemma. The second term gives, upon 
interchanging the integrals over 2 and t, the other term in the right-hand side of the lemma. 


□ 


In [2], we use an analogous “triple convolution” identity in our proof of Halasz’s Theorem 
over number fields, but the key analytic technique there is Perron’s formula rather than 
Cauchy’s formula, which leads to several additional complications. 


3.1. Proof of Halasz’s Theorem in function fields. Theorem 11.11 clearly holds when 
n — 1, and so we suppose below that n > 2. Since a(n ) depends only on the values of / 
on prime powers with degree at most n, we are motivated to use the multiplicative function 
/ ± (= f ±,n ) as described in the introduction. We recall that a ± (j) = a(j) for all j < n — 1, 
and that cr ± (n) = cr[n ) — y(n)/n, and note that is an entire function for all z G C. 

Now use Lemma 13.11 with T replaced by there. From our observations above, we 
obtain (with T L '{z) denoting the derivative of T L [z)) 


(3.2) cr(n) 


xH 

n 






dz dt 
z n+1 t 


By (11.121) we see uJ r±> /J r± (u) = Y/j=i x(.3)( c L u ) : ‘ is a finite sum, so we may take the inner 
integral over z in (13. 2 1) to be over the circle with radius 1 /(qVt), and obtain 


(3.3) er(n) 


xH 

n 


n 


p n— 1 n— 1 

J|2| -W? 3=1 3=1 


dz dt 
z n+l t 
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Now consider the inner integral in (13.3]) . Using Cauchy-Schwarz we see that 


1 „ 71 — 1 71—1 

J W=-^Tt 3=1 3=1 


dz 


v77+l 


< ( qVt) n ( max |J r± (te)| 


n —1 


\z\=—^-p= 
1 1 qy/t 


2vr 


\^2x(j){q z Y 


\ Z \~qs/t 3 = 1 


\dz\ 


(3.4) 


x 


i r \ n ~ 1 

2 nL, |ETJ)(«‘T 

3 =i 


\dz \\ 3 


By Parseval, and since \x(j)\ < « for ah j, we have 


(3.5) 


1 

27T 


„ 71—1 

'!*!=« 1 J= i 


: |ob| 


n —1 


n—1 


3=1 3=1 


Inserting this into (13.4[) . we deduce that (13. 4 p is 


< n 2 (qVt) n 


max 

z\= Vl 

I I q 





K 2 q n t 


in —1 


max |-7 r± (^) 


hl=— 

I I q 


Inserting this into (13.31) yields Theorem 11.11 


3.2. Proof of Corollary 11.21 As mentioned in the introduction, the maximum modulus 
principle gives 


max |J r± (^)| < max|J r± (^) 
M=f 1*1=5 


= : e“ M (2 n) K 


for 0 < t < 1. Moreover, by definition when \z\ < 1/q we have 


71 — 1 


log |^ W | = Re( £ ^(9*)*) < * E ^ ^ K E ^ = -Ml - )■ 


k =1 


k =1 


fe>l 


Therefore 

(3.6) 


max |.r (z)| < min 

z[= Vt 


(e~ M (2n) K ,(l-Vi)- K y 
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Taking t = (1 — u) 2 , and using (13.6f) . we obtain 


k~ I I max | J- (z) 
'o V M=^ 


1 - 1 


n— 1 


l - 1 


dt 


p 1 -j 

< k 2 / min (e^ M (2n) K ,u~ K ^\ min ((n — 1), —-r) (2(1 — u))du 

L V / V u(2 — u)J 


min (eT M (2n) K , u~ K ^j min ^2 n, —^jdu 

re M/K/ 2n 


1/2 n 


<k 2 [ / 2 ne m ( 2n) K du + 


e~ M (2n) K — + 


du 


' 1/2 n 


n 


— K ,— 1 


^ Je M / K /2r 


2l i , e (^ n ) K ^ 

n 


= n 1 e~ M (2n) K + e -iW (2n) K — + 
V hi 

= Ke~ M (2n) K (K+ 1 + M) - k. 


Substituting this bound into Theorem 11.11 yields Corollary 11.21 



4. LlPSCHITZ ESTIMATES: A KEY PROPOSITION 

Throughout this section, we restrict attention to / G C(l), and prove an appropriate modi¬ 
fication of Corollary 11.21 to bound the difference | < 7 , 9(71 + t) — ag(n)\. 

Proposition 4.1. Let f be in the class C(l), and let <7 (77) and x( n ) be defined as in (11.61) 
and oai. Let n >2. Define, for a given 9 G M/Z, 


n—1 


L{n , L, 9) = max (1 - z e ) exp ( V xe(j) —) 
W =1 


For any integer £ > 1, we have (recall og(k) = a{k)e(—k9)) 

t „\ , w £ L{n,£\Q)( , 

\a e (n + t) — <7 g (71) C — I- ( 1 + log 

71 77 V 


277 


T(77, 0) 


We will apply this result for a suitable choice of 9 in the next section, so as to deduce 
Theorem 11.31 


Proof. Let X ± {k), o^ik) and ^ r± (-) be defined as in the introduction, so that X ± (k) = x(k) 
when k < n — 1, and X ± (k) = 0 for larger k, and 

7- L (2/9) = f^<T 1 (nL”=exp( V Lhz l ). 

n =0 k<n —1 

If k < 77 — 1 we have cr^/c) = cr(/c), and hence, using (j 1.8 [) . if 1 < i < 77 then 

n+£ 

(77 + H)|< 7(77 + £) — < 7^(77 + £)| < ^ |x(/c)cr(77 + ■£—&) — X _L (fc)<7 _L (77 + £ — fc)| < 2(2£ + 1), 

k=1 
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since the terms cancel out unless k < i or k > n, and as |x(-)|, |x _L (-)|, |cr(-)|, |o- L (-)| are all 
at most 1. Therefore we have 


(4.1) 


n | ag(n + t) — crg(n)\ = n \cr(n + £)e{—£9) — cr(n)\ 

< |(n + £)a{n + £)e(—£9) — na(n )| + £ 

< | (n + + £)e(—£9) — na ± (n)\ +5^ + 4. 


Following the above tidying up, we switch to our main analytic argument and apply 
Lemma [3.1 1 to the first term on the right hand side, obtaining 

(n + £)a ± (n + £)e(—£9) — ncr _L (n) = 


(4.2) 


2ni 


'\z\=r 


(^yrC 0 )) (^yr(^))- ?r± (^) 


,_i_ e(—£9)(qz ) e — 1 dz dt 


(qz) r 


z t 


As before we take the inner integral to be over the circle with radius r = l/(qy/t). Using that 


uJ-’ ±l /J~' ± (u) = J2j=i x(j)(q u Y, and by Cauchy-Schwarz, we can bound the inner integral 
by 


maxi J r± (tz)(e(—£9)(qz) t — 1 )Ix 
bl=77* 


(4.3) 


t 2 


1 

27T 


„ n —1 

/ I ^2x(J)(q z ) j 


2 \dz\\h / 1 
V27T 


n —1 


'* 1 = 77 ! 1 1=1 


\dz\\ \ 


Using the Parseval bound of (13. 5 p (with k = 1 there) and the display immediately following 
(13.5ft . the second line of (14.3h is 


< t 


1 -t 


n— 1 


l - 1 


< t min < n, 


1 -t 


To bound the maximum on the first line of (14.31) . we first let w = qtz so that 

max \J r± (tz)(e(—£9)(qz)~ c — 1)\ = max \J r± (w/q)(e(—£9)(w/t)~ { — l)\ 

W=^ \w\=Vt 

= max \J r± (w/q)(e(£9)w e — 1) j, 

\w\=V~t 

where the final equality holds because (w/t) -1 is the complex conjugate of w when |w| = y/t. 
By the maximum modulus principle, this is 


< max | J r± (w/q)(e(£9)w e — 1)| = max \e(—£9)w 1 — 1| • exp (Re( 

|iu|=i |iw|=i V V ^ J 


k<n —1 


k 


Writing w = e(—9)z, we see that this is L(n, £\ 9). Since |(n)| < (1 — \qz\) 1 for \z\ < l/q, 
we also have the alternative bound 

max \J r± (tz)(e(—£9)(qz)~ e — 1)| < 2 max \J r± (tz)\ < 


l*l=^T 


l*l=^T 


i - Vt' 


Inserting these bounds into (14.3p and then (14.2p yields 
\(n + £)a^{n-\-£)e(—£9) — n<j L (n)\ < [ minjn 


’ 1 -t 


nun 




Xdt. 
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'/ 2n f 1 / L L 2 

2 nLdu + / — du + —du 

0 Jl/2n u Jl/L U 


Arguing as in the proof of Corollary 11.21 we set t — (1 — w) 2 so that the integral is 

f 1 r 1 'i f 21 f 1/2n /,1/i r />1 

< / min < 2 n, — > min \ L, — \du < 

Jo 1 u j V u J J o 

= L + L\og(2n/L) + 2L — 2 

where L = L(n,£] 0). Inserting this into (14.1H . we obtain 

n \(Te{n + i) — ag(n)\ < 5£ + 3L + Llog(2n/L) + 2, 
and the result follows. 


□ 


5. LlPSCHITZ ESTIMATES: PROOF OF THEOREM 11.31 

In the previous section we estimated \<Jg(n + (!) — ag(n) \ in terms of the parameter L(n, £; 6) 
defined in Proposition 14.11 In the next lemma we show that if we choose 9 such that 
R e (Sj=i Xe(j)/j) is maximized (which is the same as choosing it such that |J r± (e(— 9)/q)\ 
is maximized), then 


n —1 


L(n,£]6)< max L*(n, £\ a) where L*(n, £; a) := |1 — e(£a)\ exp ( 
ae[0,l) V 


k=1 


cos(7t/cq:)| ^ 


k 


We then proceed to give accurate estimates, up to a constant, for each L*(n,£;a ) and, 
optimizing, deduce Theorem 11.31 

5.1. Determining what is to be optimized. 


Lemma 5.1. Select 6 so as to maximize R.e(^'.' =1 1 Xe(j)/j)- Then 


n—1 


(1 -z‘) exp(^x»(i)4 

j-i 1 


n—1 


< |1 — e(£a)\ exp ( ^ 

z=e(a) \ ^ ' k / 

’ k =1 


cos(irka)\ \ 


Proof. If z = e(a) then, by the definition of 9 as the maximizer, 


n—1 


n—1 




3 = 1 


Xo(J) 


3 =1 
n—1 


n—1 

£ 

3 =1 


Xe{j) i 


Re / £ X.(jMW2) ms(wJ A < g |cos(7rfca)| 


1=1 


fc=l 


k 


This proves the lemma. Note also that equality holds in the last step above when y(fc) = 
e(k(9 — a/2)) sign (cos (nba)), and so the lemma is sharp in general. □ 

5.2. Upper bounds for a given a. 

Lemma 5.2. Suppose n > 2 and a G [0,1) are given. Let R := [log n], and select m < 2R 
such that |a — h/m\ < l/(2mi?) for some ( h,m ) = 1. Then 


L*(n,£;a) x \\£a\\ n* ( min{n, l/||ma 


m 


0 ( 1 ) 
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where ||i|| is the distance from t to the nearest integer, and 


m— 1 


Cm • — 


m 


| cos(7ra/‘ 


m = 


a =0 


cosec(7r/2 m) r ■ 

-——- it m is odd , 

m J 

cot(7r/2 m) r 

———- it m is even. 

m J 


An alternative expression for c m is 


Cm. 


^- 2 E 


r> 1 
m\r 


(-1 ) r 
4 r 2 — 


Proof. The function | cos(7rf)| is periodic with period 1, and a little computation gives the 
Fourier expansion 


cos(7rt)| = ^ 


(-l)' r+ , v 2 ^ 

Art) = - - 


-l ) 1 


rEZ 


2tt (r 2 — 1/4) 


7r Af 2tt (r 2 — 1/4) 


(2 cos(27rrt)). 


Therefore 

(5.1) 


n— 1 

E 

k= 1 


C0s(7r/CQ()| 

k 


2 /£-4 1 


l) r cos(27t kra) 


E -E 


7r v x ' k •4—' 4r 2 — 1 

fc=l r>l 


k 


The first sum is = logn + 0(1), and all subsequent sums over k are -C log n, so we may 
truncate the r-sum at r < R, with an error of 0(1). 

Let S(x) := J2 k<x cos(27r kra), which is easily seen to be -C l/|[ra||. By partial summation, 
we deduce that if K l/||ra|| then 


E 

k>K 


cos(2nkra) 
k 


' K 


dS(t) _ S(K) 
t ~ K 


Ik 


t 2 


dt -C 


K\\ra\ 


< 1. 


Therefore 


n -1 , n , \ min{l/||ra||,n} 

~~—f: ~ = T + 0(1) = log(min{l/||ra||,n})+ 0(1). 


k =1 


k =1 


k 


For each r < R we have |ra — r&/m| < r/2mR < 1/2m. Therefore if m { r then 
||m|| > 1/2 m, and so log(min{l/||ro;||,?z}) = log(l/||r&/m||) + 0(1). Since this is <C logm 
we see in particular that the terms in the r-sum, for which R > r > logm and m \ r, 
contribute 0(1). Therefore the contribution of the terms r < R with m \r to (15. ip is 


(5.2) 


4 

7T 


E 


-i)’ 


l<r<log m 


4 r 2 — 1 


log 


\rb/m\ 


+ 0(1) = 0(1 + logm). 


Note also that if r < R and m\r then ||ra|| = (r/m) ||ma;||, and so the contribution of 
these terms to (15.ip is 


(-i)’ 


-- E 

7r ^ 4r 2 — 1 

l<r<i? 

m\r 


log min ( n, 


r/m)\\ma 

Using these observations in (15.ip . we deduce the lemma. 


) + 0(1) = (c m - log min fn, 1 ) + 0(1). 

/ V 7T/ V ma / 


□ 
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Remark 10. From (I5.2[) . we see that the m 0 ^ term in Lemma [5.21 can be replaced by the 
more precise expression 

ex p(-^ S 

l<r<l°g m 

where (t) m is the least residue of t (modm), in absolute value. This can be shown to be <C 
m 1 " (which is attained when 6 = 1 ) and 3 > mEi (which is attained when 26 = 1 (modm)). 


Corollary 5.3. Let m 0 be the smallest odd integer that does not divide i. Then 

max L*(n,P,a) x max{n Cra °m?^,n’f 1 "*}. 

« 6 [ 0 , 1 ) 


Proof. Observe first, for use later, that c± — 1 and c 2 = 1/2, and that the c m tend upwards 
to 2/n as m varies over even values, and they tend downwards to 2/tt as m varies over odd 
values. Note also that if m is odd we have - + > c m >- + -%, for certain absolute 

constants C, c > 0. 

Let 0 < a < 1, and correspondingly choose 1 < m < 2 [log n\ as in Lemma [5.21 First we 
shall establish the upper bound implicit in Corollary 15.31 

In the case where m is even we have c m — ^ < 0, so Lemma [5.21 directly implies that 

L*(n,P,a) <C \\£ce\\ n*m°^ < n* m 0(yl \ 

Since m 0 is odd we have c mo > ^ + ^, and since also m,m 0 -C log n we conclude that 

L*(n,£;cn) -C n Cm om^ (1 ^ if m is even, as required. 

In the case where m is odd we have c m — | > 0, and we need to divide into some sub-cases. 
Firstly, if m > mo then c m < c mo , and so Lemma [5.21 implies similarly as before that 

L*(n,£;a) <C ||^a|| n Crn m°^ < n Cm m°^ <C n Cm °m^ l ' > . 

If m is odd and m < mo then we must have m \ £, by definition of mo- Now there are 
two further sub-cases. Firstly, if ||ma|| < \j£ then we use the bound \\£a\\ < (£/m)||ma|| < 
fj|ma|| in conjunction with Lemma [5.21 obtaining 

L*(n,£]a) < \\£®\\ n^\\ma\\^~ Cm m°^ < £n^\\ma\\ 1+ ^~ Cm m 0< ' 1 ' > < £ Crn ~^m 0( ^\ 


since \\ma\\ < l/£. Then since m < m 0 log(2T), and c m < C 3 = 2/3 unless m = 1, this 
bound is -C which is acceptable. 

The remaining sub-case is where m < mo is odd but ||ma|| > l/£, in which case Lemma 
15.21 gives 


L*(n, £;a) <C ||£a|| ||ma:|| Crn m° < ' 1 ' > < n*\\ma\ 


m 


< n^£ Cm *m 


which is acceptable as in the previous sub-case. 

We complete the proof by establishing the lower bound implicit in the corollary: namely, 
max Qg [o,i) L*{n,£\a) maxjn^m^^n*/ 1- "}. We show that the first term in the max is 

attained when a = 6 /mo with ( 6 , mo) = 1 , and the second term in the max is attained when 
a is roughly of size l/£. Indeed, if a = 6 /mo f° r some ( 6 , mo) = 1 then Lemma [5.21 (with 
m = mo) gives 


L*(n, i\ a) 



n c "*»m 0 °^ xn Cm °rn^P, 


since m 0 does not divide £. 
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This suffices, unless £ is at least a large multiple of logn (otherwise n^i 1 * < n c ”om^, 
since c mo > | + -ffz > | + log2 c f y ). If £ is at least a large multiple of logn, then for any 

l/(4f?) < ||a|| < l/(2£) we can take m — 1 in Lemma [5.21 and obtain 

L*(n,£',a ) x ||Rr|| n* ||a||F _C1 x £n* ||a|| 1+ F _C1 xnif 1 ^*, 

since C\ — 1. □ 

Proof of Theorem \1.3[ As noted earlier, if 0 in Proposition 14.11 is chosen as in Lemma 15.11 
then L(n, £\ 6) < max ae [ 0j i) L*(n, £; a). Now using the bound of Corollary 15.31 in Proposition 
14.11 we obtain Theorem 11.31 We record that, using Remark 10, one can replace (log n) 0( -b 
in the statement of Theorem 11.31 by (logn) 2- ^. □ 

6. Final examples 

We now give examples which establish that the upper bound in Theorem 11.31 is attained for 
each fixed odd m. These are based on the discussion of Example 9, with a suitable choice of 
the points ctj there. 

Suppose that for integers k > 1, the function x(k) is periodic (modm). Write 

1 m b 

X(j) = — ^X(k)e(- —), 
m V m / 

k =1 

so that for all n > 1 we have 

m 

3 = 1 

This class of examples satisfies the hypothesis of Example 9, and so the corresponding 
solution cr(n) is given by the generating function 

m 

J>(n)*" = ]^[(1 — ze(j / . 

n>0 j= 1 

Arguing as in Example 9, we may find asymptotics for a(n). 

We now consider the special case where m > 1 is odd, and x(fc) = sign(cos(27rfc/m)) for 
all k > 1. Thus x(fc) = 1 when ||/c/m|| < \ and x(k) = —1 otherwise. Recall from Remark 
3 above that we are free to construct examples simply by specifying the behaviour of x(fc). 
Since xik) = xi m ~ k) for all 1 < k < m — 1 we see that 

/ \ m— 1 m 

x(j) = 1 -- H-V cos{2njk/m)x{k) = — V] cos(2tt jk/m)x(k). 

m m z —' m z ' 

k =1 k =1 

In particular 

^ m 

X(l) = x{m - 1) = — V] \cos(2nk/m)\ = c m , 

k =1 

whereas x(j) is a real number smaller than c m for all other values of j (because the values x(fc) 
no longer perfectly “resonate” with the coefficients cos(27t jk/m)). Therefore, by Example 
9, we have 

a(n) = (C'o(l)e(n/m) + C' 0 (m-l)e(-n/m) + o(l))——- = C[ cos (- h/3) +o(l)j——- 

V J r(c m ) V V m J ) r(c m ) 




















18 


ANDREW GRANVILLE, ADAM J HARPER, AND KANNAN SOUNDARARAJAN 


for a suitable non-zero real number C, and some (3. 

Now Theorem 11.31 asserts that for any 1 < £ < n such that m is the smallest odd integer 
not dividing £, we will have 

| a e (n + £) — <Jg(n)\ = \a(n + £)e{-£6) - a(n) | < f-) " log ^ -, 

\n/ £ n 1_Cm 

for a suitable 9 e [0, 1). We now show that we will have \cr(n + £)e(—£9) — a(n) | 1 /n l ~ Cm 

for some (in fact a positive proportion of) n values, for any such fixed £, so that the second 
term in the upper bound is sharp up to logarithmic factors. Indeed, for all 9 we have 


| a(n + £)e(—£9) — cr(n)\ > | a(n + £) \ — |cr(n)| 

n‘ 


C 


c ™~ 1 ' '2n (n + £) 


r(c„ 


cos 


m 


+ P 


cos 


'27t n 
V m 


+ P 


+ o 


This will be ^> m 1/n 1 Cm unless + f3) = ±( 27r ^ + ^ + f3) (mod7r), and since m is odd and 
m \ £ that can only happen if (— +/3) = — ( 27r( ? " +f ^- + /3) (mod 7r), which (since m is odd) can 
only happen for those n in one particular residue class (modm). This justifies the remarks 
made at the end of the introduction concerning the optimality of the exponents appearing 
in Theorem 11.31 

To give a more explicit example, when m = 3 the analysis of Example 9 gives, for oj = e(|), 


^o(n) z ' 

n> 0 


(1 - z ) 1 / 3 

((1 — uz){l — U^)) 2 / 3 


l-z 

(1 — ^3)2/3’ 


and indeed that cr(3n) = —cr(3n + 1) = T(n + |)/n!T(|) ~ n 1 / 3 /T(|), and a(3n + 2) = 0. 
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